FTST Journal | ANALYSIS OF UNSTRUCTURED DATASET USING AN ENHANCED DESCRIPTIVE MINING ALGORITHM

Journal Details

Aim and Scope Editorial Board Advisory Board Indexing Peer Review FTST Policies Contact Us

Paper Submission

Author Instructions Guidelines for Writing Paper Payment Instruction FAQ

Digital Library

Current Issue Archive

Join Us

Join Editorial Panel Join us as Reviewer

FTST Feedback

Feedback Subscribe to Alert

FUW TRENDS IN SCIENCE & TECHNOLOGY JOURNAL

ANALYSIS OF UNSTRUCTURED DATASET USING AN ENHANCED DESCRIPTIVE MINING ALGORITHM
Pages: 673-678
A. R. Ajiboye, C. Umezuruike and R. E. Peter
53FTSTJ062020

keywords: Clustering algorithm, unstructured dataset, classification, descriptive mining, data dictionary

Abstract

The grouping of large unstructured dataset is one of the main tasks in cluster analysis. A dataset is unstructured if it has a muddle of data types whose pattern makes it uneasy to search or partition. Unstructured dataset is difficult to classify because it does not have a defined schema. An Enhanced Descriptive Mining Algorithm (EDMA) proposed in this study was used to group the given instances in the input space into a number of clusters. The aim of this study is to partition and analyse a given unstructured dataset to its constituent’s distinct features. In order to achieve this central objective, the proposed EDMA is implemented along with the data dictionary created within the program to support the analysis; the implementation was carried out using java programming language. The unstructured dataset taken as input was retrieved from an open repository and comprised of numeric, alphabetic and some special characters. The resulting output of this study shows a well clustered data that is partitioned according to their similarity features. Based on a number of metrics, the performance of the proposed technique is determined by evaluating its effectiveness in relation to some existing techniques: k-means and EM clustering techniques. Findings from this study showed that, the proposed technique is reliable, accurate, and very suitable for the clustering of unstructured dataset.

References

Highlights

FTST Journal News

FTSTJ Volume 10 Issue 3 December 2025 Edition is out

FTST Journal Volume 10 Issue 3 December 2025 Edition is out. FTST Journal is Published by Federal University Wukari. Submit your manuscript to ftstjournal688@gmail.com

FTSTJ is indexed with

FTSTJ is indexed with highly recognized services...

FTSTJ is indexed with